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IN THE CLAIMS: 

The text of all pending claims, (including withdrawn claims) is set forth below. Cancelled 
and not entered claims are indicated with claim number and status only. The claims as listed 
below show added text with underlining and deleted text with strikethrough . The status of each 
claim is indicated with one of (original), (currently amended), (cancelled), (withdrawn), (new), 
(previously presented), or (not entered). 

Please AMEND claims 1, 6 and 16-21 in accordance with the following: 

1 . (currently amended) A method for collecting a docum e nt documents linked to each 
other from a network by crawling the network , comprising: 

collecting documents equal to or larger, in number, than a predetermined value from 
inside a community through the network based on a reference of the document; and 

collecting documents from inside and outside the community based on the reference of 
collected documents after collecting the documents equal to or larger in number than the 
predetermined value from inside the community. 

2. (original) The method according to claim 1 , further comprising: 

computing a significance level indicating a level of significance of the collected document 
according to the reference of the collected document, and information about a position of the 
collected document in the network; and 

determining a document to be collected based on the reference and the significance 

level. 

3. (original) The method according to claim 2, wherein said document to be collected is 
determined separately for inside the community and for outside the community. 

4. (original) The method according to claim 3 f further comprising: 
presenting a result of retrieving the collected documents separately for inside the 

community and outside the community. 

5. (original) The method according to claim 2, further comprising: 

determining whether or not the document is in the community according to information 
indicating the position of the document in the network. 
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6. (currently amended) A method for collecting a docum e nt documents linked to each 
other from a network by crawling the network , comprising: 

providing a positive sample document group which is a document group relating to a 
field, and a negative sample document group which is a document group less related to the field; 

determining a document which is to be collected and is related to the field based on a 
reference to the positive sample document group and the negative sample document group; 
and 

collecting the document to be collected from the network. 

7. (original) The method according to claim 6, further comprising: 

computing a reference score indicating a level at which a document is referenced only by 
a document in the positive sample document group based on the reference; and 

determining a document having a high reference score as the document to be collected. 

8. (currently amended) The method according to claim 6, wherein 

computing a co-reference score ind i ca i t i ng indicating a level at which a document is 
referenced together with a document in the positive sample document group for a document 
referenced by a collected document referring to a document in the positive sample document 
group based on the reference; and 

determining a document having a high co-reference score as the document to be 
collected. 

9. (original) The method according to claim 6, wherein 

said negative sample document group is a union of document groups relating to a 
plurality of fields. 

10. (original) The method according to claim 1, further comprising: 

summarizing said collected document group based on a referencing expression used in 
the collected document group. 

11. (original) The method according to claim 1, further comprising: 

assigning a keyword to the collected document based on a referencing expression used 
in the collected document. 

12. (original) The method according to claim 1, further comprising: 
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not assigning a keyword based on the referring expression when the referencing 
expression is used regardless of a content of a referenced document. 

13. (original) The method according to claim 11 , further comprising: 

counting a number of different documents referenced using the referencing expression; 

and 

not assigning the keyword based on the referencing expression when the number of 
different documents is equal to or larger than a predetermined value. 

14. (original) The method according to claim 11 , further comprising: 

counting a reference frequency at which each collected document is referenced by the 
referencing expression when the number of different documents is smaller than a predetermined 
value; and 

determining whether or not the referencing expression is assigned as the keyword based 
on the number of different documents and the reference frequency. 

15. (original) The method according to claim 11 , further comprising: 

combining the keyword based on the referencing expression with a keyword extracted 
from text of the collected document, and a keyword extracted from information indicating a 
position in the network about the collected document. 

16. (currently amended) A method for retrieving a docum e nt documents linked to each 
other from a terminal belonging to a community in a network by crawling the network , 
comprising: 

transmitting information for retrieval of the docum e nt documents to a server; and 
receiving the docum e nt documents retrieved separately from inside and outside the 

community according to the information for retrieval together with information indicating a 

significance level for the community. 

17. (currently amended) A document collection apparatus collecting a docum e nt 
documents linked to each other f rom a network by crawling the network , comprising: 

a next prospect determination unit determining a prospect to be collected next based on 
a reference of a collected document; 
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a community determination unit determining whether or not the prospect is in a 
community in the network according to information indicating a position in the network of the 
prospect; and 

a document collection unit collecting the prospect from the network, wherein said 
document collection unit collects the prospect from inside and outside the community after 
collecting documents larger in number than a predetermined value from inside the community. 

18. (currently amended) A document collection apparatus collecting a docum e nt 
documents linked to each other from a network by crawling the network , comprising: 

a next prospect determination unit determining a prospect to be collected next based on 
a reference between a positive sample document group which is a document group related to a 
field and a negative sample document group which is a document group less related tg_the field; 
and 

a document collection unit collecting the prospect from the network. 

19. (currently amended) A computer-readable recording medium recording a program 
used to direct a computer to control collection of a docum e nt documents linked to each other 
from a network by crawling the network , comprising: 

collecting documents equal to or larger, in number, than a predetermined value from a 
community through the network based on a reference of the document; and 

collecting documents from inside and outside the community based on the reference of 
collected documents after collecting the documents equal to or larger, in number, than the 
predetermined value from inside the community. 

20. (currently amended) A computer-readable recording medium recording a program 
used to direct a computer to control collection of a docum e nt documents linked to each other 
from a network by crawling the network , comprising: 

providing a positive sample document group which is a document group relating to a 
field, and a negative sample document group which is a document group less related to the field; 

determining a document to be collected relating to the field based on a reference to the 
positive sample document group and the negative sample document group; and 

collecting the document to be eeHeet -collected from the network. 

21 . (currently amended) A computer data signal embodied on a carrier expressing a 
program used to direct a computer to control collection of a docum e nt documents linked to each 
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other from a network by crawling the network , said program a ll ow i ng instructing the computer to 
perform the process comprising: 

collecting documents equal to or larger than, in number, a predetermined value from 
inside a community in the network based on a reference of the docum e nt documents ; and 

collecting documents from inside and outside the community based on the reference of 
collected documents after collecting documents equal to or larger, in number, than the 
predetermined value from the community. 
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